Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 52325 |
| Missing cells | 349292 |
| Missing cells (%) | 35.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.6 MiB |
| Average record size in memory | 152.0 B |
Variable types
| Text | 2 |
|---|---|
| Categorical | 3 |
| Numeric | 13 |
| Boolean | 1 |
MRG has constant value "" | Constant |
TENURE is highly imbalanced (87.4%) | Imbalance |
REGION has 20576 (39.3%) missing values | Missing |
MONTANT has 18400 (35.2%) missing values | Missing |
FREQUENCE_RECH has 18400 (35.2%) missing values | Missing |
REVENUE has 17687 (33.8%) missing values | Missing |
ARPU_SEGMENT has 17687 (33.8%) missing values | Missing |
FREQUENCE has 17687 (33.8%) missing values | Missing |
DATA_VOLUME has 25647 (49.0%) missing values | Missing |
ON_NET has 19155 (36.6%) missing values | Missing |
ORANGE has 21760 (41.6%) missing values | Missing |
TIGO has 31265 (59.8%) missing values | Missing |
ZONE1 has 48134 (92.0%) missing values | Missing |
ZONE2 has 48965 (93.6%) missing values | Missing |
TOP_PACK has 21963 (42.0%) missing values | Missing |
FREQ_TOP_PACK has 21963 (42.0%) missing values | Missing |
DATA_VOLUME is highly skewed (γ1 = 32.65507376) | Skewed |
user_id has unique values | Unique |
DATA_VOLUME has 7822 (14.9%) zeros | Zeros |
ON_NET has 2637 (5.0%) zeros | Zeros |
ORANGE has 1553 (3.0%) zeros | Zeros |
TIGO has 2234 (4.3%) zeros | Zeros |
ZONE1 has 1460 (2.8%) zeros | Zeros |
ZONE2 has 989 (1.9%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-24 12:54:45.505045 |
|---|---|
| Analysis finished | 2024-04-24 12:55:33.132735 |
| Duration | 47.63 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
user_id
Text
UNIQUE 
| Distinct | 52325 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 408.9 KiB |
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 40 |
| Min length | 40 |
Characters and Unicode
| Total characters | 2093000 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 52325 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 00000bfd7d50f01092811bc0c8d7b0d6fe7c3596 |
|---|---|
| 2nd row | 00000cb4a5d760de88fecb38e2f71b7bec52e834 |
| 3rd row | 00001654a9d9f96303d9969d0a4a851714a4bb57 |
| 4th row | 00001dd6fa45f7ba044bd5d84937be464ce78ac2 |
| 5th row | 000028d9e13a595abe061f9b58f3d76ab907850f |
| Value | Count | Frequency (%) |
| 00000bfd7d50f01092811bc0c8d7b0d6fe7c3596 | 1 | < 0.1% |
| 0000527d276a6ba8b02810cc2c1d60d25e650f5f | 1 | < 0.1% |
| 0000cd42663b7542ccac678690d07c73179a5268 | 1 | < 0.1% |
| 0000c0f0fd1a7b922b099a0c5434fa5fff9a6f44 | 1 | < 0.1% |
| 00001654a9d9f96303d9969d0a4a851714a4bb57 | 1 | < 0.1% |
| 00001dd6fa45f7ba044bd5d84937be464ce78ac2 | 1 | < 0.1% |
| 000028d9e13a595abe061f9b58f3d76ab907850f | 1 | < 0.1% |
| 0000296564272665ccd2925d377e124f3306b01e | 1 | < 0.1% |
| 00002b0ed56e2c199ec8c3021327229afa70f063 | 1 | < 0.1% |
| 0000313946b6849745963442c6e572d47cd24ced | 1 | < 0.1% |
| Other values (52315) | 52315 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 185601 | 8.9% |
| 3 | 133397 | 6.4% |
| 1 | 133026 | 6.4% |
| 2 | 132967 | 6.4% |
| 4 | 132817 | 6.3% |
| 5 | 132422 | 6.3% |
| 6 | 126317 | 6.0% |
| c | 124364 | 5.9% |
| 7 | 124289 | 5.9% |
| 9 | 124276 | 5.9% |
| Other values (6) | 743524 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1349220 | |
| Lowercase Letter | 743780 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 185601 | |
| 3 | 133397 | |
| 1 | 133026 | |
| 2 | 132967 | |
| 4 | 132817 | |
| 5 | 132422 | |
| 6 | 126317 | |
| 7 | 124289 | |
| 9 | 124276 | |
| 8 | 124108 |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 124364 | |
| b | 124269 | |
| d | 123947 | |
| e | 123902 | |
| a | 123836 | |
| f | 123462 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1349220 | |
| Latin | 743780 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 185601 | |
| 3 | 133397 | |
| 1 | 133026 | |
| 2 | 132967 | |
| 4 | 132817 | |
| 5 | 132422 | |
| 6 | 126317 | |
| 7 | 124289 | |
| 9 | 124276 | |
| 8 | 124108 |
Latin
| Value | Count | Frequency (%) |
| c | 124364 | |
| b | 124269 | |
| d | 123947 | |
| e | 123902 | |
| a | 123836 | |
| f | 123462 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2093000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 185601 | 8.9% |
| 3 | 133397 | 6.4% |
| 1 | 133026 | 6.4% |
| 2 | 132967 | 6.4% |
| 4 | 132817 | 6.3% |
| 5 | 132422 | 6.3% |
| 6 | 126317 | 6.0% |
| c | 124364 | 5.9% |
| 7 | 124289 | 5.9% |
| 9 | 124276 | 5.9% |
| Other values (6) | 743524 |
REGION
Categorical
MISSING 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20576 |
| Missing (%) | 39.3% |
| Memory size | 408.9 KiB |
| DAKAR | |
|---|---|
| THIES | |
| SAINT-LOUIS | |
| LOUGA | |
| KAOLACK | |
| Other values (9) |
Length
| Max length | 11 |
|---|---|
| Median length | 5 |
| Mean length | 6.3128602 |
| Min length | 5 |
Characters and Unicode
| Total characters | 200427 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FATICK |
|---|---|
| 2nd row | DAKAR |
| 3rd row | DAKAR |
| 4th row | LOUGA |
| 5th row | LOUGA |
Common Values
| Value | Count | Frequency (%) |
| DAKAR | 12526 | |
| THIES | 4457 | 8.5% |
| SAINT-LOUIS | 2840 | 5.4% |
| LOUGA | 2408 | 4.6% |
| KAOLACK | 2351 | 4.5% |
| DIOURBEL | 1652 | 3.2% |
| TAMBACOUNDA | 1326 | 2.5% |
| KAFFRINE | 1053 | 2.0% |
| FATICK | 920 | 1.8% |
| KOLDA | 889 | 1.7% |
| Other values (4) | 1327 | 2.5% |
| (Missing) | 20576 |
Length
| Value | Count | Frequency (%) |
| dakar | 12526 | |
| thies | 4457 | 14.0% |
| saint-louis | 2840 | 8.9% |
| louga | 2408 | 7.6% |
| kaolack | 2351 | 7.4% |
| diourbel | 1652 | 5.2% |
| tambacounda | 1326 | 4.2% |
| kaffrine | 1053 | 3.3% |
| fatick | 920 | 2.9% |
| kolda | 889 | 2.8% |
| Other values (4) | 1327 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 43212 | |
| K | 20117 | |
| D | 16489 | 8.2% |
| R | 15777 | 7.9% |
| I | 14923 | 7.4% |
| O | 12135 | 6.1% |
| T | 10228 | 5.1% |
| S | 10206 | 5.1% |
| L | 10140 | 5.1% |
| U | 8895 | 4.4% |
| Other values (10) | 38305 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 197587 | |
| Dash Punctuation | 2840 | 1.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 43212 | |
| K | 20117 | |
| D | 16489 | 8.3% |
| R | 15777 | 8.0% |
| I | 14923 | 7.6% |
| O | 12135 | 6.1% |
| T | 10228 | 5.2% |
| S | 10206 | 5.2% |
| L | 10140 | 5.1% |
| U | 8895 | 4.5% |
| Other values (9) | 35465 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2840 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 197587 | |
| Common | 2840 | 1.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 43212 | |
| K | 20117 | |
| D | 16489 | 8.3% |
| R | 15777 | 8.0% |
| I | 14923 | 7.6% |
| O | 12135 | 6.1% |
| T | 10228 | 5.2% |
| S | 10206 | 5.2% |
| L | 10140 | 5.1% |
| U | 8895 | 4.5% |
| Other values (9) | 35465 |
Common
| Value | Count | Frequency (%) |
| - | 2840 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200427 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 43212 | |
| K | 20117 | |
| D | 16489 | 8.2% |
| R | 15777 | 7.9% |
| I | 14923 | 7.4% |
| O | 12135 | 6.1% |
| T | 10228 | 5.1% |
| S | 10206 | 5.1% |
| L | 10140 | 5.1% |
| U | 8895 | 4.4% |
| Other values (10) | 38305 |
TENURE
Categorical
IMBALANCE 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 408.9 KiB |
| K > 24 month | |
|---|---|
| I 18-21 month | 1042 |
| H 15-18 month | 606 |
| G 12-15 month | 364 |
| J 21-24 month | 317 |
| Other values (4) | 297 |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 12.042962 |
| Min length | 3 |
Characters and Unicode
| Total characters | 630148 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | K > 24 month |
|---|---|
| 2nd row | I 18-21 month |
| 3rd row | K > 24 month |
| 4th row | K > 24 month |
| 5th row | K > 24 month |
Common Values
| Value | Count | Frequency (%) |
| K > 24 month | 49699 | |
| I 18-21 month | 1042 | 2.0% |
| H 15-18 month | 606 | 1.2% |
| G 12-15 month | 364 | 0.7% |
| J 21-24 month | 317 | 0.6% |
| F 9-12 month | 224 | 0.4% |
| E 6-9 month | 50 | 0.1% |
| D 3-6 month | 22 | < 0.1% |
| K > | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| month | 52324 | |
| k | 49700 | |
| 49700 | ||
| 24 | 49699 | |
| i | 1042 | 0.5% |
| 18-21 | 1042 | 0.5% |
| h | 606 | 0.3% |
| 15-18 | 606 | 0.3% |
| 12-15 | 364 | 0.2% |
| g | 364 | 0.2% |
| Other values (8) | 1226 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 154348 | ||
| m | 52324 | 8.3% |
| o | 52324 | 8.3% |
| n | 52324 | 8.3% |
| t | 52324 | 8.3% |
| h | 52324 | 8.3% |
| 2 | 51963 | 8.2% |
| 4 | 50016 | 7.9% |
| K | 49700 | 7.9% |
| > | 49700 | 7.9% |
| Other values (14) | 12801 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 261620 | |
| Space Separator | 154348 | |
| Decimal Number | 109530 | |
| Uppercase Letter | 52325 | 8.3% |
| Math Symbol | 49700 | 7.9% |
| Dash Punctuation | 2625 | 0.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 51963 | |
| 4 | 50016 | |
| 1 | 4565 | 4.2% |
| 8 | 1648 | 1.5% |
| 5 | 970 | 0.9% |
| 9 | 274 | 0.3% |
| 6 | 72 | 0.1% |
| 3 | 22 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 49700 | |
| I | 1042 | 2.0% |
| H | 606 | 1.2% |
| G | 364 | 0.7% |
| J | 317 | 0.6% |
| F | 224 | 0.4% |
| E | 50 | 0.1% |
| D | 22 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 52324 | |
| o | 52324 | |
| n | 52324 | |
| t | 52324 | |
| h | 52324 |
Space Separator
| Value | Count | Frequency (%) |
| 154348 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 49700 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2625 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 316203 | |
| Latin | 313945 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 52324 | |
| o | 52324 | |
| n | 52324 | |
| t | 52324 | |
| h | 52324 | |
| K | 49700 | |
| I | 1042 | 0.3% |
| H | 606 | 0.2% |
| G | 364 | 0.1% |
| J | 317 | 0.1% |
| Other values (3) | 296 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 154348 | ||
| 2 | 51963 | 16.4% |
| 4 | 50016 | 15.8% |
| > | 49700 | 15.7% |
| 1 | 4565 | 1.4% |
| - | 2625 | 0.8% |
| 8 | 1648 | 0.5% |
| 5 | 970 | 0.3% |
| 9 | 274 | 0.1% |
| 6 | 72 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 630148 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 154348 | ||
| m | 52324 | 8.3% |
| o | 52324 | 8.3% |
| n | 52324 | 8.3% |
| t | 52324 | 8.3% |
| h | 52324 | 8.3% |
| 2 | 51963 | 8.2% |
| 4 | 50016 | 7.9% |
| K | 49700 | 7.9% |
| > | 49700 | 7.9% |
| Other values (14) | 12801 | 2.0% |
MONTANT
Real number (ℝ)
MISSING 
| Distinct | 1002 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 18400 |
| Missing (%) | 35.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5597.3932 |
| Minimum | 50 |
|---|---|
| Maximum | 120870 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 250 |
| Q1 | 1000 |
| median | 3000 |
| Q3 | 7500 |
| 95-th percentile | 18800 |
| Maximum | 120870 |
| Range | 120820 |
| Interquartile range (IQR) | 6500 |
Descriptive statistics
| Standard deviation | 7196.8937 |
|---|---|
| Coefficient of variation (CV) | 1.2857581 |
| Kurtosis | 24.589717 |
| Mean | 5597.3932 |
| Median Absolute Deviation (MAD) | 2450 |
| Skewness | 3.5748256 |
| Sum | 1.8989157 × 108 |
| Variance | 51795279 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 2712 | 5.2% |
| 1000 | 2040 | 3.9% |
| 1500 | 1132 | 2.2% |
| 2000 | 1079 | 2.1% |
| 200 | 1043 | 2.0% |
| 3000 | 828 | 1.6% |
| 2500 | 809 | 1.5% |
| 4000 | 627 | 1.2% |
| 3500 | 583 | 1.1% |
| 100 | 499 | 1.0% |
| Other values (992) | 22573 | |
| (Missing) | 18400 |
| Value | Count | Frequency (%) |
| 50 | 10 | < 0.1% |
| 100 | 499 | |
| 130 | 1 | < 0.1% |
| 150 | 48 | 0.1% |
| 200 | 1043 | |
| 250 | 231 | 0.4% |
| 300 | 289 | 0.6% |
| 350 | 54 | 0.1% |
| 400 | 334 | 0.6% |
| 450 | 60 | 0.1% |
| Value | Count | Frequency (%) |
| 120870 | 1 | |
| 120000 | 1 | |
| 119450 | 1 | |
| 114400 | 1 | |
| 106900 | 1 | |
| 93000 | 1 | |
| 91500 | 2 | |
| 89500 | 1 | |
| 89000 | 1 | |
| 88700 | 1 |
FREQUENCE_RECH
Real number (ℝ)
MISSING 
| Distinct | 100 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 18400 |
| Missing (%) | 35.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.668651 |
| Minimum | 1 |
|---|---|
| Maximum | 106 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 7 |
| Q3 | 16 |
| 95-th percentile | 40 |
| Maximum | 106 |
| Range | 105 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 13.418135 |
|---|---|
| Coefficient of variation (CV) | 1.1499302 |
| Kurtosis | 5.1744832 |
| Mean | 11.668651 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 2.0924059 |
| Sum | 395859 |
| Variance | 180.04635 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5369 | 10.3% |
| 2 | 3293 | 6.3% |
| 3 | 2611 | 5.0% |
| 4 | 2130 | 4.1% |
| 5 | 1831 | 3.5% |
| 6 | 1593 | 3.0% |
| 7 | 1352 | 2.6% |
| 8 | 1243 | 2.4% |
| 9 | 1042 | 2.0% |
| 10 | 965 | 1.8% |
| Other values (90) | 12496 | |
| (Missing) | 18400 |
| Value | Count | Frequency (%) |
| 1 | 5369 | |
| 2 | 3293 | |
| 3 | 2611 | |
| 4 | 2130 | 4.1% |
| 5 | 1831 | 3.5% |
| 6 | 1593 | 3.0% |
| 7 | 1352 | 2.6% |
| 8 | 1243 | 2.4% |
| 9 | 1042 | 2.0% |
| 10 | 965 | 1.8% |
| Value | Count | Frequency (%) |
| 106 | 2 | |
| 103 | 1 | < 0.1% |
| 98 | 1 | < 0.1% |
| 97 | 2 | |
| 96 | 1 | < 0.1% |
| 95 | 1 | < 0.1% |
| 94 | 1 | < 0.1% |
| 93 | 2 | |
| 92 | 3 | |
| 91 | 1 | < 0.1% |
REVENUE
Real number (ℝ)
MISSING 
| Distinct | 9678 |
|---|---|
| Distinct (%) | 27.9% |
| Missing | 17687 |
| Missing (%) | 33.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5578.589 |
| Minimum | 1 |
|---|---|
| Maximum | 147739 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 199 |
| Q1 | 1000 |
| median | 3000 |
| Q3 | 7499 |
| 95-th percentile | 19001 |
| Maximum | 147739 |
| Range | 147738 |
| Interquartile range (IQR) | 6499 |
Descriptive statistics
| Standard deviation | 7296.1927 |
|---|---|
| Coefficient of variation (CV) | 1.3078921 |
| Kurtosis | 25.983209 |
| Mean | 5578.589 |
| Median Absolute Deviation (MAD) | 2499 |
| Skewness | 3.6018265 |
| Sum | 1.9323116 × 108 |
| Variance | 53234428 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 1411 | 2.7% |
| 1000 | 898 | 1.7% |
| 200 | 504 | 1.0% |
| 1500 | 478 | 0.9% |
| 2000 | 427 | 0.8% |
| 3000 | 326 | 0.6% |
| 2500 | 303 | 0.6% |
| 4000 | 212 | 0.4% |
| 3500 | 205 | 0.4% |
| 100 | 178 | 0.3% |
| Other values (9668) | 29696 | |
| (Missing) | 17687 |
| Value | Count | Frequency (%) |
| 1 | 115 | |
| 2 | 67 | |
| 3 | 8 | < 0.1% |
| 4 | 51 | |
| 5 | 4 | < 0.1% |
| 6 | 25 | < 0.1% |
| 7 | 9 | < 0.1% |
| 8 | 32 | 0.1% |
| 9 | 30 | 0.1% |
| 10 | 76 |
| Value | Count | Frequency (%) |
| 147739 | 1 | |
| 126314 | 1 | |
| 124000 | 1 | |
| 108216 | 1 | |
| 96959 | 1 | |
| 93195 | 1 | |
| 93001 | 1 | |
| 90067 | 1 | |
| 89502 | 1 | |
| 89224 | 1 |
ARPU_SEGMENT
Real number (ℝ)
MISSING 
| Distinct | 5730 |
|---|---|
| Distinct (%) | 16.5% |
| Missing | 17687 |
| Missing (%) | 33.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1859.5349 |
| Minimum | 0 |
|---|---|
| Maximum | 49246 |
| Zeros | 115 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 66 |
| Q1 | 333 |
| median | 1000 |
| Q3 | 2500 |
| 95-th percentile | 6334 |
| Maximum | 49246 |
| Range | 49246 |
| Interquartile range (IQR) | 2167 |
Descriptive statistics
| Standard deviation | 2432.0598 |
|---|---|
| Coefficient of variation (CV) | 1.307886 |
| Kurtosis | 25.983376 |
| Mean | 1859.5349 |
| Median Absolute Deviation (MAD) | 833 |
| Skewness | 3.6018465 |
| Sum | 64410571 |
| Variance | 5914914.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 167 | 1625 | 3.1% |
| 333 | 1089 | 2.1% |
| 500 | 668 | 1.3% |
| 67 | 570 | 1.1% |
| 667 | 532 | 1.0% |
| 1000 | 444 | 0.8% |
| 833 | 374 | 0.7% |
| 1333 | 276 | 0.5% |
| 1167 | 272 | 0.5% |
| 233 | 230 | 0.4% |
| Other values (5720) | 28558 | |
| (Missing) | 17687 |
| Value | Count | Frequency (%) |
| 0 | 115 | |
| 1 | 126 | |
| 2 | 38 | 0.1% |
| 3 | 138 | |
| 4 | 73 | |
| 5 | 47 | 0.1% |
| 6 | 22 | < 0.1% |
| 7 | 91 | |
| 8 | 18 | < 0.1% |
| 9 | 26 | < 0.1% |
| Value | Count | Frequency (%) |
| 49246 | 1 | |
| 42105 | 1 | |
| 41333 | 1 | |
| 36072 | 1 | |
| 32320 | 1 | |
| 31065 | 1 | |
| 31000 | 1 | |
| 30022 | 1 | |
| 29834 | 1 | |
| 29741 | 1 |
FREQUENCE
Real number (ℝ)
MISSING 
| Distinct | 91 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 17687 |
| Missing (%) | 33.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.111958 |
| Minimum | 1 |
|---|---|
| Maximum | 91 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 9 |
| Q3 | 20 |
| 95-th percentile | 46 |
| Maximum | 91 |
| Range | 90 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 14.863632 |
|---|---|
| Coefficient of variation (CV) | 1.053265 |
| Kurtosis | 3.3860032 |
| Mean | 14.111958 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 1.7760738 |
| Sum | 488810 |
| Variance | 220.92756 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3942 | 7.5% |
| 2 | 2758 | 5.3% |
| 3 | 2293 | 4.4% |
| 4 | 1983 | 3.8% |
| 5 | 1731 | 3.3% |
| 6 | 1532 | 2.9% |
| 7 | 1425 | 2.7% |
| 8 | 1233 | 2.4% |
| 9 | 1198 | 2.3% |
| 10 | 1031 | 2.0% |
| Other values (81) | 15512 | |
| (Missing) | 17687 |
| Value | Count | Frequency (%) |
| 1 | 3942 | |
| 2 | 2758 | |
| 3 | 2293 | |
| 4 | 1983 | |
| 5 | 1731 | |
| 6 | 1532 | 2.9% |
| 7 | 1425 | 2.7% |
| 8 | 1233 | 2.4% |
| 9 | 1198 | 2.3% |
| 10 | 1031 | 2.0% |
| Value | Count | Frequency (%) |
| 91 | 1 | < 0.1% |
| 90 | 1 | < 0.1% |
| 89 | 7 | |
| 88 | 4 | < 0.1% |
| 87 | 8 | |
| 86 | 9 | |
| 85 | 9 | |
| 84 | 5 | < 0.1% |
| 83 | 12 | |
| 82 | 15 |
DATA_VOLUME
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 7971 |
|---|---|
| Distinct (%) | 29.9% |
| Missing | 25647 |
| Missing (%) | 49.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3548.6737 |
| Minimum | 0 |
|---|---|
| Maximum | 926547 |
| Zeros | 7822 |
| Zeros (%) | 14.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 276 |
| Q3 | 2981 |
| 95-th percentile | 15388.9 |
| Maximum | 926547 |
| Range | 926547 |
| Interquartile range (IQR) | 2981 |
Descriptive statistics
| Standard deviation | 15318.304 |
|---|---|
| Coefficient of variation (CV) | 4.3166278 |
| Kurtosis | 1545.23 |
| Mean | 3548.6737 |
| Median Absolute Deviation (MAD) | 276 |
| Skewness | 32.655074 |
| Sum | 94671518 |
| Variance | 2.3465043 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7822 | 14.9% |
| 1 | 970 | 1.9% |
| 2 | 323 | 0.6% |
| 3 | 173 | 0.3% |
| 4 | 158 | 0.3% |
| 1024 | 138 | 0.3% |
| 5 | 112 | 0.2% |
| 6 | 101 | 0.2% |
| 1023 | 94 | 0.2% |
| 7 | 88 | 0.2% |
| Other values (7961) | 16699 | |
| (Missing) | 25647 |
| Value | Count | Frequency (%) |
| 0 | 7822 | |
| 1 | 970 | 1.9% |
| 2 | 323 | 0.6% |
| 3 | 173 | 0.3% |
| 4 | 158 | 0.3% |
| 5 | 112 | 0.2% |
| 6 | 101 | 0.2% |
| 7 | 88 | 0.2% |
| 8 | 69 | 0.1% |
| 9 | 81 | 0.2% |
| Value | Count | Frequency (%) |
| 926547 | 1 | |
| 867127 | 1 | |
| 752018 | 1 | |
| 720309 | 1 | |
| 611581 | 1 | |
| 576214 | 1 | |
| 490458 | 1 | |
| 443833 | 1 | |
| 434373 | 1 | |
| 322763 | 1 |
ON_NET
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 2578 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 19155 |
| Missing (%) | 36.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 278.24787 |
| Minimum | 0 |
|---|---|
| Maximum | 23595 |
| Zeros | 2637 |
| Zeros (%) | 5.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 27 |
| Q3 | 157 |
| 95-th percentile | 1340 |
| Maximum | 23595 |
| Range | 23595 |
| Interquartile range (IQR) | 152 |
Descriptive statistics
| Standard deviation | 867.60815 |
|---|---|
| Coefficient of variation (CV) | 3.1181124 |
| Kurtosis | 86.87487 |
| Mean | 278.24787 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 7.4879948 |
| Sum | 9229482 |
| Variance | 752743.91 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2637 | 5.0% |
| 1 | 2226 | 4.3% |
| 2 | 1389 | 2.7% |
| 7 | 1038 | 2.0% |
| 8 | 996 | 1.9% |
| 3 | 989 | 1.9% |
| 4 | 943 | 1.8% |
| 6 | 728 | 1.4% |
| 5 | 692 | 1.3% |
| 9 | 465 | 0.9% |
| Other values (2568) | 21067 | |
| (Missing) | 19155 |
| Value | Count | Frequency (%) |
| 0 | 2637 | |
| 1 | 2226 | |
| 2 | 1389 | |
| 3 | 989 | 1.9% |
| 4 | 943 | 1.8% |
| 5 | 692 | 1.3% |
| 6 | 728 | 1.4% |
| 7 | 1038 | 2.0% |
| 8 | 996 | 1.9% |
| 9 | 465 | 0.9% |
| Value | Count | Frequency (%) |
| 23595 | 1 | |
| 21480 | 1 | |
| 17400 | 1 | |
| 15175 | 1 | |
| 14972 | 1 | |
| 14645 | 1 | |
| 14506 | 1 | |
| 13683 | 1 | |
| 13167 | 1 | |
| 12864 | 1 |
ORANGE
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 1099 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 21760 |
| Missing (%) | 41.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 95.388353 |
| Minimum | 0 |
|---|---|
| Maximum | 5841 |
| Zeros | 1553 |
| Zeros (%) | 3.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7 |
| median | 29 |
| Q3 | 97 |
| 95-th percentile | 393 |
| Maximum | 5841 |
| Range | 5841 |
| Interquartile range (IQR) | 90 |
Descriptive statistics
| Standard deviation | 204.04721 |
|---|---|
| Coefficient of variation (CV) | 2.1391208 |
| Kurtosis | 83.48438 |
| Mean | 95.388353 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 6.9106027 |
| Sum | 2915545 |
| Variance | 41635.262 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1675 | 3.2% |
| 0 | 1553 | 3.0% |
| 2 | 1189 | 2.3% |
| 3 | 876 | 1.7% |
| 4 | 818 | 1.6% |
| 8 | 613 | 1.2% |
| 5 | 559 | 1.1% |
| 6 | 557 | 1.1% |
| 7 | 546 | 1.0% |
| 10 | 497 | 0.9% |
| Other values (1089) | 21682 | |
| (Missing) | 21760 |
| Value | Count | Frequency (%) |
| 0 | 1553 | |
| 1 | 1675 | |
| 2 | 1189 | |
| 3 | 876 | |
| 4 | 818 | |
| 5 | 559 | 1.1% |
| 6 | 557 | 1.1% |
| 7 | 546 | 1.0% |
| 8 | 613 | 1.2% |
| 9 | 466 | 0.9% |
| Value | Count | Frequency (%) |
| 5841 | 1 | |
| 4196 | 1 | |
| 4185 | 1 | |
| 3631 | 1 | |
| 3525 | 1 | |
| 3457 | 1 | |
| 3330 | 1 | |
| 3284 | 1 | |
| 3280 | 1 | |
| 3205 | 1 |
TIGO
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 422 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 31265 |
| Missing (%) | 59.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.246819 |
| Minimum | 0 |
|---|---|
| Maximum | 2663 |
| Zeros | 2234 |
| Zeros (%) | 4.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 6 |
| Q3 | 20 |
| 95-th percentile | 96 |
| Maximum | 2663 |
| Range | 2663 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 63.873315 |
|---|---|
| Coefficient of variation (CV) | 2.7476153 |
| Kurtosis | 275.73245 |
| Mean | 23.246819 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 12.070005 |
| Sum | 489578 |
| Variance | 4079.8003 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2786 | 5.3% |
| 0 | 2234 | 4.3% |
| 2 | 1813 | 3.5% |
| 3 | 1314 | 2.5% |
| 4 | 1085 | 2.1% |
| 5 | 812 | 1.6% |
| 6 | 733 | 1.4% |
| 7 | 630 | 1.2% |
| 8 | 610 | 1.2% |
| 9 | 507 | 1.0% |
| Other values (412) | 8536 | 16.3% |
| (Missing) | 31265 |
| Value | Count | Frequency (%) |
| 0 | 2234 | |
| 1 | 2786 | |
| 2 | 1813 | |
| 3 | 1314 | |
| 4 | 1085 | 2.1% |
| 5 | 812 | 1.6% |
| 6 | 733 | 1.4% |
| 7 | 630 | 1.2% |
| 8 | 610 | 1.2% |
| 9 | 507 | 1.0% |
| Value | Count | Frequency (%) |
| 2663 | 1 | |
| 1651 | 1 | |
| 1594 | 1 | |
| 1512 | 1 | |
| 1476 | 1 | |
| 1379 | 1 | |
| 1374 | 1 | |
| 1317 | 1 | |
| 1240 | 1 | |
| 1160 | 1 |
ZONE1
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 147 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 48134 |
| Missing (%) | 92.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.7625865 |
| Minimum | 0 |
|---|---|
| Maximum | 1427 |
| Zeros | 1460 |
| Zeros (%) | 2.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 32 |
| Maximum | 1427 |
| Range | 1427 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 44.152445 |
|---|---|
| Coefficient of variation (CV) | 5.0387457 |
| Kurtosis | 376.42957 |
| Mean | 8.7625865 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 16.151948 |
| Sum | 36724 |
| Variance | 1949.4384 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1460 | 2.8% |
| 1 | 1027 | 2.0% |
| 2 | 394 | 0.8% |
| 3 | 235 | 0.4% |
| 4 | 146 | 0.3% |
| 5 | 124 | 0.2% |
| 6 | 81 | 0.2% |
| 7 | 61 | 0.1% |
| 9 | 60 | 0.1% |
| 8 | 54 | 0.1% |
| Other values (137) | 549 | 1.0% |
| (Missing) | 48134 |
| Value | Count | Frequency (%) |
| 0 | 1460 | |
| 1 | 1027 | |
| 2 | 394 | 0.8% |
| 3 | 235 | 0.4% |
| 4 | 146 | 0.3% |
| 5 | 124 | 0.2% |
| 6 | 81 | 0.2% |
| 7 | 61 | 0.1% |
| 8 | 54 | 0.1% |
| 9 | 60 | 0.1% |
| Value | Count | Frequency (%) |
| 1427 | 1 | |
| 963 | 1 | |
| 820 | 1 | |
| 673 | 1 | |
| 627 | 1 | |
| 554 | 1 | |
| 506 | 1 | |
| 463 | 1 | |
| 422 | 1 | |
| 406 | 1 |
ZONE2
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 110 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 48965 |
| Missing (%) | 93.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.5291667 |
| Minimum | 0 |
|---|---|
| Maximum | 932 |
| Zeros | 989 |
| Zeros (%) | 1.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 5 |
| 95-th percentile | 29 |
| Maximum | 932 |
| Range | 932 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 28.504446 |
|---|---|
| Coefficient of variation (CV) | 3.78587 |
| Kurtosis | 412.87348 |
| Mean | 7.5291667 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 16.335985 |
| Sum | 25298 |
| Variance | 812.50347 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 989 | 1.9% |
| 1 | 659 | 1.3% |
| 2 | 366 | 0.7% |
| 3 | 209 | 0.4% |
| 4 | 191 | 0.4% |
| 5 | 129 | 0.2% |
| 6 | 101 | 0.2% |
| 7 | 69 | 0.1% |
| 8 | 67 | 0.1% |
| 9 | 61 | 0.1% |
| Other values (100) | 519 | 1.0% |
| (Missing) | 48965 |
| Value | Count | Frequency (%) |
| 0 | 989 | |
| 1 | 659 | |
| 2 | 366 | 0.7% |
| 3 | 209 | 0.4% |
| 4 | 191 | 0.4% |
| 5 | 129 | 0.2% |
| 6 | 101 | 0.2% |
| 7 | 69 | 0.1% |
| 8 | 67 | 0.1% |
| 9 | 61 | 0.1% |
| Value | Count | Frequency (%) |
| 932 | 1 | |
| 541 | 1 | |
| 527 | 1 | |
| 279 | 1 | |
| 258 | 1 | |
| 252 | 1 | |
| 246 | 1 | |
| 235 | 1 | |
| 231 | 1 | |
| 215 | 2 |
MRG
Boolean
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 102.3 KiB |
| False | |
|---|---|
| (Missing) | 1 |
| Value | Count | Frequency (%) |
| False | 52324 | |
| (Missing) | 1 | < 0.1% |
REGULARITY
Real number (ℝ)
| Distinct | 62 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.077785 |
| Minimum | 1 |
|---|---|
| Maximum | 62 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 24 |
| Q3 | 51 |
| 95-th percentile | 62 |
| Maximum | 62 |
| Range | 61 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 22.330879 |
|---|---|
| Coefficient of variation (CV) | 0.79532196 |
| Kurtosis | -1.490132 |
| Mean | 28.077785 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.24649481 |
| Sum | 1469142 |
| Variance | 498.66815 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4737 | 9.1% |
| 62 | 4124 | 7.9% |
| 2 | 2894 | 5.5% |
| 3 | 2059 | 3.9% |
| 4 | 1680 | 3.2% |
| 61 | 1562 | 3.0% |
| 5 | 1396 | 2.7% |
| 6 | 1292 | 2.5% |
| 60 | 1207 | 2.3% |
| 7 | 1105 | 2.1% |
| Other values (52) | 30268 |
| Value | Count | Frequency (%) |
| 1 | 4737 | |
| 2 | 2894 | |
| 3 | 2059 | |
| 4 | 1680 | 3.2% |
| 5 | 1396 | 2.7% |
| 6 | 1292 | 2.5% |
| 7 | 1105 | 2.1% |
| 8 | 990 | 1.9% |
| 9 | 864 | 1.7% |
| 10 | 822 | 1.6% |
| Value | Count | Frequency (%) |
| 62 | 4124 | |
| 61 | 1562 | 3.0% |
| 60 | 1207 | 2.3% |
| 59 | 1000 | 1.9% |
| 58 | 808 | 1.5% |
| 57 | 772 | 1.5% |
| 56 | 703 | 1.3% |
| 55 | 635 | 1.2% |
| 54 | 615 | 1.2% |
| 53 | 614 | 1.2% |
TOP_PACK
Text
MISSING 
| Distinct | 88 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 21963 |
| Missing (%) | 42.0% |
| Memory size | 408.9 KiB |
Length
| Max length | 49 |
|---|---|
| Median length | 42 |
| Mean length | 23.180324 |
| Min length | 9 |
Characters and Unicode
| Total characters | 703801 |
|---|---|
| Distinct characters | 70 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | On net 200F=Unlimited _call24H |
|---|---|
| 2nd row | On-net 1000F=10MilF;10d |
| 3rd row | Data:1000F=5GB,7d |
| 4th row | Mixt 250F=Unlimited_call24H |
| 5th row | MIXT:500F= 2500F on net _2500F off net;2d |
| Value | Count | Frequency (%) |
| all-net | 9254 | 12.2% |
| 500f=2000f;5d | 7543 | 9.9% |
| net | 6260 | 8.3% |
| on | 5824 | 7.7% |
| 200f=unlimited | 3726 | 4.9% |
| call24h | 3726 | 4.9% |
| 2500f | 3242 | 4.3% |
| data | 3150 | 4.2% |
| data:490f=1gb,7d | 2876 | 3.8% |
| mixt | 2202 | 2.9% |
| Other values (121) | 28053 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 98753 | 14.0% |
| 45494 | 6.5% | |
| l | 42419 | 6.0% |
| F | 40898 | 5.8% |
| t | 39233 | 5.6% |
| n | 35161 | 5.0% |
| 2 | 33085 | 4.7% |
| e | 28318 | 4.0% |
| a | 27364 | 3.9% |
| 5 | 26640 | 3.8% |
| Other values (60) | 286436 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 252515 | |
| Decimal Number | 194888 | |
| Uppercase Letter | 124287 | |
| Space Separator | 45494 | 6.5% |
| Other Punctuation | 32013 | 4.5% |
| Math Symbol | 26597 | 3.8% |
| Connector Punctuation | 15824 | 2.2% |
| Dash Punctuation | 11141 | 1.6% |
| Close Punctuation | 434 | 0.1% |
| Open Punctuation | 434 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 40898 | |
| D | 11476 | 9.2% |
| A | 10929 | 8.8% |
| H | 9495 | 7.6% |
| B | 8301 | 6.7% |
| M | 8009 | 6.4% |
| U | 7550 | 6.1% |
| O | 5894 | 4.7% |
| G | 5510 | 4.4% |
| I | 3337 | 2.7% |
| Other values (15) | 12888 | 10.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 42419 | |
| t | 39233 | |
| n | 35161 | |
| e | 28318 | |
| a | 27364 | |
| d | 24236 | |
| i | 19679 | |
| o | 8054 | 3.2% |
| m | 7723 | 3.1% |
| c | 6185 | 2.4% |
| Other values (12) | 14143 | 5.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 98753 | |
| 2 | 33085 | 17.0% |
| 5 | 26640 | 13.7% |
| 4 | 15380 | 7.9% |
| 1 | 10753 | 5.5% |
| 3 | 3377 | 1.7% |
| 7 | 3300 | 1.7% |
| 9 | 3015 | 1.5% |
| 6 | 481 | 0.2% |
| 8 | 104 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 11502 | |
| : | 11055 | |
| , | 9388 | |
| . | 57 | 0.2% |
| / | 11 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 26542 | |
| + | 55 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 45494 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 15824 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11141 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 434 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 434 |
Control
| Value | Count | Frequency (%) |
| 174 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 376802 | |
| Common | 326999 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 42419 | |
| F | 40898 | |
| t | 39233 | |
| n | 35161 | 9.3% |
| e | 28318 | 7.5% |
| a | 27364 | 7.3% |
| d | 24236 | 6.4% |
| i | 19679 | 5.2% |
| D | 11476 | 3.0% |
| A | 10929 | 2.9% |
| Other values (37) | 97089 |
Common
| Value | Count | Frequency (%) |
| 0 | 98753 | |
| 45494 | ||
| 2 | 33085 | 10.1% |
| 5 | 26640 | 8.1% |
| = | 26542 | 8.1% |
| _ | 15824 | 4.8% |
| 4 | 15380 | 4.7% |
| ; | 11502 | 3.5% |
| - | 11141 | 3.4% |
| : | 11055 | 3.4% |
| Other values (13) | 31583 | 9.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 703801 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 98753 | 14.0% |
| 45494 | 6.5% | |
| l | 42419 | 6.0% |
| F | 40898 | 5.8% |
| t | 39233 | 5.6% |
| n | 35161 | 5.0% |
| 2 | 33085 | 4.7% |
| e | 28318 | 4.0% |
| a | 27364 | 3.9% |
| 5 | 26640 | 3.8% |
| Other values (60) | 286436 |
FREQ_TOP_PACK
Real number (ℝ)
MISSING 
| Distinct | 116 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 21963 |
| Missing (%) | 42.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.3510309 |
| Minimum | 1 |
|---|---|
| Maximum | 174 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 408.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 5 |
| Q3 | 12 |
| 95-th percentile | 33 |
| Maximum | 174 |
| Range | 173 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 12.190022 |
|---|---|
| Coefficient of variation (CV) | 1.303602 |
| Kurtosis | 14.252226 |
| Mean | 9.3510309 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 3.0509684 |
| Sum | 283916 |
| Variance | 148.59664 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 6068 | 11.6% |
| 2 | 3705 | 7.1% |
| 3 | 2835 | 5.4% |
| 4 | 2080 | 4.0% |
| 5 | 1678 | 3.2% |
| 6 | 1398 | 2.7% |
| 7 | 1191 | 2.3% |
| 8 | 1119 | 2.1% |
| 9 | 917 | 1.8% |
| 10 | 822 | 1.6% |
| Other values (106) | 8549 | 16.3% |
| (Missing) | 21963 |
| Value | Count | Frequency (%) |
| 1 | 6068 | |
| 2 | 3705 | |
| 3 | 2835 | |
| 4 | 2080 | 4.0% |
| 5 | 1678 | 3.2% |
| 6 | 1398 | 2.7% |
| 7 | 1191 | 2.3% |
| 8 | 1119 | 2.1% |
| 9 | 917 | 1.8% |
| 10 | 822 | 1.6% |
| Value | Count | Frequency (%) |
| 174 | 1 | |
| 151 | 1 | |
| 139 | 1 | |
| 136 | 2 | |
| 126 | 1 | |
| 125 | 2 | |
| 124 | 1 | |
| 122 | 1 | |
| 121 | 2 | |
| 120 | 1 |
CHURN
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 408.9 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 156972 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 42486 | |
| 1.0 | 9838 | 18.8% |
| (Missing) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 42486 | |
| 1.0 | 9838 | 18.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 94810 | |
| . | 52324 | |
| 1 | 9838 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 104648 | |
| Other Punctuation | 52324 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 94810 | |
| 1 | 9838 | 9.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 52324 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 156972 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 94810 | |
| . | 52324 | |
| 1 | 9838 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 156972 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 94810 | |
| . | 52324 | |
| 1 | 9838 | 6.3% |
| user_id | REGION | TENURE | MONTANT | FREQUENCE_RECH | REVENUE | ARPU_SEGMENT | FREQUENCE | DATA_VOLUME | ON_NET | ORANGE | TIGO | ZONE1 | ZONE2 | MRG | REGULARITY | TOP_PACK | FREQ_TOP_PACK | CHURN | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 00000bfd7d50f01092811bc0c8d7b0d6fe7c3596 | FATICK | K > 24 month | 4250.0 | 15.0 | 4251.0 | 1417.0 | 17.0 | 4.0 | 388.0 | 46.0 | 1.0 | 1.0 | 2.0 | NO | 54.0 | On net 200F=Unlimited _call24H | 8.0 | 0.0 |
| 1 | 00000cb4a5d760de88fecb38e2f71b7bec52e834 | NaN | I 18-21 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 4.0 | NaN | NaN | 1.0 |
| 2 | 00001654a9d9f96303d9969d0a4a851714a4bb57 | NaN | K > 24 month | 3600.0 | 2.0 | 1020.0 | 340.0 | 2.0 | NaN | 90.0 | 46.0 | 7.0 | NaN | NaN | NO | 17.0 | On-net 1000F=10MilF;10d | 1.0 | 0.0 |
| 3 | 00001dd6fa45f7ba044bd5d84937be464ce78ac2 | DAKAR | K > 24 month | 13500.0 | 15.0 | 13502.0 | 4501.0 | 18.0 | 43804.0 | 41.0 | 102.0 | 2.0 | NaN | NaN | NO | 62.0 | Data:1000F=5GB,7d | 11.0 | 0.0 |
| 4 | 000028d9e13a595abe061f9b58f3d76ab907850f | DAKAR | K > 24 month | 1000.0 | 1.0 | 985.0 | 328.0 | 1.0 | NaN | 39.0 | 24.0 | NaN | NaN | NaN | NO | 11.0 | Mixt 250F=Unlimited_call24H | 2.0 | 0.0 |
| 5 | 0000296564272665ccd2925d377e124f3306b01e | LOUGA | K > 24 month | 8500.0 | 17.0 | 9000.0 | 3000.0 | 18.0 | NaN | 252.0 | 70.0 | 91.0 | NaN | NaN | NO | 62.0 | MIXT:500F= 2500F on net _2500F off net;2d | 18.0 | 0.0 |
| 6 | 00002b0ed56e2c199ec8c3021327229afa70f063 | LOUGA | K > 24 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 2.0 | NaN | NaN | 0.0 |
| 7 | 0000313946b6849745963442c6e572d47cd24ced | DAKAR | K > 24 month | 7000.0 | 16.0 | 7229.0 | 2410.0 | 22.0 | 1601.0 | 77.0 | 29.0 | 100.0 | NaN | NaN | NO | 55.0 | All-net 500F=2000F;5d | 8.0 | 0.0 |
| 8 | 0000398021ccd3a488fa1a63dee3b2f0d471f9fd | DAKAR | K > 24 month | 1500.0 | 3.0 | 1502.0 | 501.0 | 12.0 | NaN | 2.0 | 53.0 | 2.0 | NaN | NaN | NO | 31.0 | NaN | NaN | 0.0 |
| 9 | 00003d165737109921ebd21f883cb8cff028b626 | TAMBACOUNDA | K > 24 month | 4000.0 | 8.0 | 4000.0 | 1333.0 | 8.0 | NaN | 1620.0 | 9.0 | NaN | NaN | NaN | NO | 45.0 | On-net 500F_FNF;3d | 8.0 | 0.0 |
| user_id | REGION | TENURE | MONTANT | FREQUENCE_RECH | REVENUE | ARPU_SEGMENT | FREQUENCE | DATA_VOLUME | ON_NET | ORANGE | TIGO | ZONE1 | ZONE2 | MRG | REGULARITY | TOP_PACK | FREQ_TOP_PACK | CHURN | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 52315 | 064b0cca8eca9cb5421887eb5135f73d28c483bd | KAOLACK | K > 24 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 25.0 | NaN | NaN | 0.0 |
| 52316 | 064b1353e7696690d4210f2ffb5e4063f7aa84c6 | DAKAR | K > 24 month | 500.0 | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 5.0 | NaN | NaN | 0.0 |
| 52317 | 064b15946a12f77fa9041f12de5c171b8c9ef93e | THIES | K > 24 month | 2100.0 | 6.0 | 2580.0 | 860.0 | 7.0 | 1208.0 | 718.0 | NaN | NaN | NaN | NaN | NO | 30.0 | On-net 500F_FNF;3d | 3.0 | 0.0 |
| 52318 | 064b1c1dbafa39581ea78d8c62acbd3264b224e2 | KAOLACK | K > 24 month | NaN | NaN | NaN | NaN | NaN | 487.0 | 5.0 | NaN | NaN | NaN | NaN | NO | 9.0 | NaN | NaN | 0.0 |
| 52319 | 064b1cbdd5b7f611299d0b5bdad5f9e0c50e24df | DAKAR | K > 24 month | 3000.0 | 6.0 | 3000.0 | 1000.0 | 6.0 | NaN | 183.0 | 1.0 | NaN | NaN | NaN | NO | 23.0 | All-net 500F=2000F;5d | 4.0 | 0.0 |
| 52320 | 064b1e66f1a980abd4ac90846de4e51c857f85d9 | NaN | K > 24 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 3.0 | NaN | NaN | 1.0 |
| 52321 | 064b2516ec20180480c45e0c9c1e5847f2dac762 | THIES | K > 24 month | 1200.0 | 2.0 | 1198.0 | 399.0 | 2.0 | NaN | 43.0 | 19.0 | NaN | NaN | 1.0 | NO | 7.0 | All-net 500F=2000F;5d | 2.0 | 0.0 |
| 52322 | 064b25af59754420bcf31661e4e31585739335b8 | NaN | K > 24 month | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NO | 4.0 | NaN | NaN | 0.0 |
| 52323 | 064b2de9946e44fdbd35a5cb584ac4567edbd1b8 | SAINT-LOUIS | K > 24 month | 100.0 | 1.0 | 100.0 | 33.0 | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NO | 3.0 | Data: 100 F=40MB,24H | 1.0 | 0.0 |
| 52324 | 064b3105f50868fc36e5ed2679e98dcf992a9c48 | ZIGUINCHOR | K > | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |